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A method of top tagging is introduced. Using the anti-kt algorithm to define jets, events with 
rij = 2 fat jets of cone size R = 1.5 are decomposed into R = 0.6 sub-jets and retained if rij (R— 
0.6) > 4 . One pair of sub-jets reconstructs the W-mass and another jet is tagged as a b-jet, as 
necessary for hadronic or semileptonic events of ti origin. This 'hollow cone' method distinguishes 
' the tt events from the light parton QCD dijet events. 

(N ■ 

• The top quark is important in the Standard Model (SM) because of its the large coupling to the Higgs field. Also, 
top is of great interest in new physics extensions of the SM because one or more new particles commonly decay into a 
single top quark or into top-quark pairs. Examples include SUSY where the stop companion of top may decay to 
' top and the gluino companion of the gluon may decay to top pairs, the Little Higgs Theory 2] where a heavy top may 
decay to top, the Randall-Sundrum model Q where the decays of Kaluza-Klein gluons may lead to an enhancement 
of top signals. Thus, the study of top at the Tevatron and at the Large Hadron Collider (LHC) should be an excellent 
probe of new physics. We focus on the semileptonic channel (one t^- b 1 v decay and one t— >-b q q decay) and the 
Q-i hadronic channel (both tops decay to quarks) ; the leptonic channel with both tops decaying via t ^b 1 v has a clean 
signal but a reduced rate. The expected tt event ratios in the SM are hadronic : semileptonic : leptonic = 0.322 : 1: 
1^ ■ 3.13 . Silicon vertex detectors have enabled the tagging of b-events with high efficiency (~ 50%) using the secodary 
■ vertes algorithm Q at the Tevatron and b-tagging efficiencies of 60% [Ij, 65% are estimated for the ATLAS and 
' CMS detectors. 

' At the Tevatron collider, b-tagging and kinematical selection has been used to identify ttbar events in the fully 
^ hadronic channel 0. For the semi-leptonic channel, the CDF collaboration employs b-tagging, jet multiplicity and an 
isolated electron or muon to do the event selection, and uses a matrix element method to tag a top Q. The DO col- 
OO \ laboration selects events with exactly 4 jets and assigns the jets - 1 jet associated with the lepton and the other 3 jets 
I with the hadronic top decay in a way that minimizes the reconstructed mass difference between the two top quarks @ . 

CN . The LHC is a top factory: in the SM about 8,000 top pairs should have been produced with the 47 pb ^ integrated 
' luminosity already taken per detector at 7 TeV. Since the LHC center-of-mass energy is high compared to the top 
, mass, the tops will typically be highly boosted, so that the decay products are close to each other, as illustrated in 
. . ' Figure [T] Thus, in the detector, at first sight, the top decay products may look like a fat jet instead of the several 
[ separate ones of which it is composed. Butterworth et al. look into subjets inside a fat jet [10|] [ll|, to tag a Higgs or 
a W boson. One issue arises with the fat jet is that soft QCD contamination becomes relevant in the analysis of the 
jet substructure. Several techniques have been developed to deal with that complication. Jet filtering fio'l keeps only 
I several hard subjets, while jet trimming [12] tries to eliminate the soft ones. Jet pruning 13] avoids soft and large 
angle jet recombinations. Following these techniques, there are several algorithms to tag a top. Kaplan et al. [l3| use 
the ratio of transverse momentum of the subjet to that of the original jet (the cut is 0.1 or 0.05) to find out which 
subjet is a " hard subjet " and which is from soft radiation, then apply a cut on invariant masses of the fat jet and 2 
of the subjets, requiring them to be within the mass windows of top and W respectively. Also, a cut on helicity angle 
of the W, which^is the angle between the reconstructed top and one of the W decay products in the reconstructed 
W rest frame [l5|, can be applied. Plehn et al. [l3] use a mass drop criteria to find sub-jets (discussed below ) and 
use a fixed mass window for top and W reconstruction, to tag tops with pr above 200 GeV. There are also several 
top tagging algorithms using other kinematics variables, without taggin g a W or a b, like two-body or multi-body 
kinematics [l3|, separation between lepton and hadronic activities of top 17 1 18 1, and jet mass distribution (T9| . 
Search strategy In the top reconstruction, to catch all the three main decay products of a top instead of a single fat 
jet, it is natural to use a large jet size, in other words, a large R parameter in the jet clustering, otherwise, several 
seperate jets will be constructed instead of one. Here R is the jet size defined by R = \rrf+(j^ where rj is the 
pseudo-rapidity difference of two particles and (f> is their azimuthal angle difference. The lower value of R = 1.5 for 
the top jet is suggested by the the analyses of ref. [lO]. Now consider a top pair. If a large R ~ 1.5 is used, it is 
likely that two jets will be constructed in the event (one from the top and the other from the t) , while more will be 
constructed using a small R. The light jets behave differently from the top jets -the number of reconstructed light 
jets does not vary with R. So after subtracting light jets from dijet events, the top contribution can be seen in the 
variation of the number of jets with cone size R. Figure [5] shows number of jets versus cone size R in events first 
selected as dijets with R = 0.6. The number of top pair events increases with R. Results are for the LHC at 7 TeV 
with 50 pb~^; the SM prediction for the number of top quark pairs is 8,000. Thus, we develop the "hollow cone " 
idea to tag top pairs. Consider the anti-kt algorithm as a " perfect cone " algorithm. When a larger cone size is used. 
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FIG. 1: subjets of a top in -q (jj plane 




FIG. 2: number of top pairs in dijet events 



both a ti event and a QCD dijet event will give two jets, when a smaller cone size is used, a ti event will have more 
jets while a QCD dijet event still have two. This means, for a fat jet with a large cone size, after subtracting the 
small cone in the middle, if there something remains in the hollow cone, the jet is likely to be a top jet, if there is 
nothing in the hollow cone, it is likely to be a light quark jet or a gluon jet. 

Our top tagging algorithm proceeds in the following steps, trying to separate top pairs from the QCD dijet events: 

1 Reconstruct jets using the anti-kt 2l| jet algorithm with R = 1.5 to obtain a set of jets. The number of jets is njets- 

2 Redo the jet reconstruction,with R = 0.6 (or R = 0.7), following recent works of ATLAS ^ and CMS [111, to 
obtain another set of jets. 

3 Keep the event as a ttbar candidate if njets,R=i.5 — 2 and nn^o ^ >2 . 

4 Go into the 2 jets reconstructed in step 1, find all the subjets for each fat jet, using the method described in p^ . 
as follows. For a fat jet of invariant mass of mj, undo the last step of jet clustering to obtain two jets ji and j2, with 
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invariant masses rriji and mj2 ( rriji > mj2 ). If mji <0.9 TOj, keep both ji and j2, otherwise, keep only ji to add to 
the subjet list and decompose further. Add ji to the jet substructure list if mji < 30 GeV, otherwise decompose ji 
iteratively. If the total number of subjets is less than 4, reject the event, because a hadronic top and one semileptonic 
top should give 4 subjets in total, and two hadronic tops will result in 6 subjets. 

5 See whether there is a W inside either of the 2 fat jets, if not, reject the event. To do this, look into a fat jet and 
iterate over all of the 2 subjets configurations. After the jet filtering [^J], if the invariant mass of the 2 subjets falls 
in the window of 65 GeV to 95 GeV, tag that configuration as a W. A similar method of W-tagging is discussed in 
(3, 0, dl, and |26|. 

6 See whether there is a b inside either of the 2 jets by b-tagging; if not, reject the event. 
Any event that survives the above sequence is tagged as a ti event. 

Signal and Background The main backgrounds here are Wbb and Zbb. Since there will be b jets in both cases, and 
the Z mass is close to W mass, these two backgrounds are indistinguishable in their hadronic decay channels. Others 
background are the QCD dijets from light quarks and gluons and QCD multi-jet events. QCD dijets events will be 
gotten rid of by the " hollow cone " cut. QCD trijet events will be eliminated by requiring 4 or more subjets. 

We generate the parton level events with MadEvent[27], then use Pythia 28] to do the parton shower and hadroniza- 
tion. Fast Jet [29II is used to reconstruct jets and analyze the jet substructure. Then we apply the following cuts in 
sequence : 

cut 1 : The "hollow cone" cut. Require njets — 2 and n^eto > 2. 
cut 2 : Total number of subjets > 4. 
cut 3 : A hadronic W can be tagged, 
cut 4 : A b jet can be tagged. 

The cut flow table of signal and background is in Table ID We assume a 0.5 b-tagging efficiency here. The recon- 





cross section (pb) 


cut l(pb) 


cut 2(pb) 


cut 3(pb) 


cut 4(pb) 


tt 


100.00 


12.63 


7.59 


5.39 


4.05 


Wbb 


239.52 


63.93 


1.40 


0.20 


0.18 


Zbb 


124.81 


23.55 


1.20 


0.57 


0.43 



TABLE I: cut flow table for signal and background 



structed W mass and top mass can be seen in figure [3] and HI 

The resulting ratio of hadronic tops to semileptonic tops is 2.81, which is consistent with the ratio of decay branching 
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FIG. 3: reconstructed W mass 

fractions of 3.13. The transverse momentum distribution of the tagged t and t is shown in figure [5] and compared 
with the transverse momentum of the QCD dijet. It shows that the method is picking top jets instead of light jets, 
and also demonstrates that top jets with relatively low pt can be tagged. Former top tagging techniques require the 
top to be harder than 200 GeV [le'l. 

Figure IHl and [7] show the distance in R of W with the nearest b jet for tt and Wbb. The ti events are more likely to 
have a b jet inside the W, while in Wbb the W and b are farther apart, as expected. 

Outlook This method can be also used in identifying new physics that has a top in the final state. The tagged top 
can serve as a distinguishable signature. Also, the " hollow cone " can be used for discovering new, relatively heavy 
and boosted particles at the LHC after taking SM top contribution into consideration. If the new particle has a mass 
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FIG. 4: reconstructed top mass 
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FIG. 5: Normalized pr distributions for top and dijets 



is not very close to the top mass, the mass of the new particle can be reconstructed by the invariant mass of the 
filtered subjets. 
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FIG. 6: Distance in R between the W-boson and the nearest b jet in tt 

events. 
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FIG. 7: Distance in R between W and the nearest b jet in Wbb 

events. 
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